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ABSTRACT 

— “A 

This  Is  an  account  of  not  only  the  lambda-calculus  but  of  its  close 
relative,  the  combinatory  calculus.  It  begins  with  an  introductory  survey,  so 
that  no  previous  knowledge  is  required.  It  is  explained  why  these  are  of  such 
importance  for  computer  software.  The  account  is  brought  up  to  the  present 
time.  It  includes  the  shortest  and  simplest  proof '  of . the  Church-Rosser 
theorem,  which  is  not  yet  published  and  appeared  in  a  limited  printing  in 
August  1982.  It  includes  a  model  of  the  combinatory  calculus,  also  not  yet 
published  but  available  in  1982  in  a  limited  printing.  An  introduction  is 
given  to  some  revolutionary  new  developments  of  the  combinatory  calculus  for 
programming  computers. 
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SIGNIFICANCE  AMD  EXPLANATION 


Programming  for  a  computer  went  through  a  couple  of  major  changes  up  to 
the  Invention  of  FORT AN.  Since  then,  though  many  competing  programming 
languages  have  been  developed,  the  basics  of  programming  have  changed 
little.  In  the  last  half  dozen  years,  some  revolutionary  new  ideas  for 
programming  have  appeared,  involving  the  very  fundamentals  of  the  lambda- 
calculus  and  the  combinatory  calculus.  By  giving  an  account  of  these  from  the 
beginning,  it  is  intended  to  make  these  revolutionary  new  ideas  more  easily 
comprehensible . 
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HIGHLIGHTS  OP  THE  HISTORY  OP  THE  LAMB DA -CALC ULUS 


J.  Barkley  Roeaer 
Kleene-nesa  la  next  to  o£de L-nesa 

1.  EARLY  BEGI  WRINGS. 

The  lambda-calculus  originated  in  order  to  atudy  function*  more  carefully.  It  was 
observed  by  Frege  In  1893  (aae  van  Heijenoort  1967,  p.  355)  that  it  aufficea  to  reatrlct 
attention  to  function*  of  a  aingle  arguaent.  Por,  auppoae  you  wiah  a  function  to  apply 
to  A  and  B  to  produce  their  sum,  A  +  B.  Let  •  be  a  function,  of  a  aingle  arguaent, 
which  when  applied  to  A  alone  produces  a  new  function,  again  of  a  single  arguaent,  whose 
value  is  A  ♦  B  when  applied  to  B  alone.  Mote  that  M  is  not  applied  aiaultaneously 
to  A  end  B,  but  successively  to  A  and  then  Bi  application  to  a  alone  produces  an 
interaediary  function  #(A),  which  gives  A  B  when  later  applied  to  B  alone.  That 
is,  A  ♦  B  ■  (MCA) ) (B) . 

This  aathod  of  reducing  the  use  of  a  function,  "+",  of  two  arguments  to  proper  use 
of  a  related  function,  "M",  of  one  arguawnt  only,  is  often  referred  to  as  "currying" 
because  it  was  brought  into  prominence  by  the  writings  of  H.  B.  Curry,  obviously,  the 
aethod  can  be  extended  to  reduce  the  use  of  a  function  of  still  aore  arguments  to  proper 
use  of  a  related  function  of  one  arguaent  only. 

This  is  the  way  coaiputers  function.  A  program  in  a  computer  is  a  function  of  a  single 
arguaent.  People  who  have  not  considered  the  natter  carefully  nay  think,  when  they  write  a 
subroutine  to  add  two  number e,  that  they  have  produced  a  program  that  is  a  function  of  two 
arguswnts.  But  what  happens  when  the  program  begins  to  run,  to  produce  the  sum  A  +  B? 
first  A  is  brought  froa  aaaory.  Suppose  that  at  that  Instant  the  computer  is  completely 
halted.  What  remains  in  the  computer  is  a  program,  to  be  applied  to  any  B  that  night  be 
forthcoming,  to  produce  the  sum  of  the  given  A  and  the  forthcoming  B.  It  is  a  function 
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o: f  on*  argument ,  depending  on  the  given  A,  to  b*  applied  to  any  B,  to  product  tha  sum 
A  ♦  B.  It  is  Frege's  intermediary  function  *(A). 

Apparently  Frege  did  not  pursue  the  idea  further.  It  was  rediscovered  independently 
(see  SchBnfinkal  1924)  together  with  the  astonishing  conclusion  that  all  functions  hawing 
to  do  with  the  structure  of  functions  can  be  built  up  out  of  only  two  basic  functions,  K 
and  S.  Let  us  adopt  the  notation  that  has  been  in  vogue  since  then.  Instead  of  writing 
the  value  that  one  gets  by  applying  the  function  F  to  A  as  F(A) ,  we  write  (FA), 
(mission  of  the  outside  parentheses  will  be  usual.  Nhen  ears  than  two  teres  occur, 
association  shall  be  to  the  left,  thus  IMF  denotes  ((MM)F),  but  M(HP)  denotes 
(M(NP) )•  Then  the  sue  of  A  and  B  would  be  written  MB. 

The  functions  K  and  S  are  such  that 

(1.1)  KAB  -  A 

(1.2)  SABC  -  AC(BC)  . 

For  a  proof  that  all  functions  can  be  built  up  of  K  and  B,  one  can  consult  the 
original  SchSnflnkel  paper.  Or  one  can  consult  the  two  early  paper a,  Curry  1929  or  Curry 
1930. 

expressions  built  up  out  of  X  end  8  by  application  (that  is,  enclosing  pairs  in 
parentheses)  are  called  "combinators."  Ose  of  thesi,  end  study  of  their  properties,  is 
called  "combinatory  logic."  Soutiaas  these  teres  are  extended  to  apply  whan  tha 
axprassions  are  allowed  to  contain  variables,  or  indeteminates,  as  well  as  K  and  S. 

Suppose  we  have  two  functions,  F  and  G  (built  up  out  of  K  and  S,  of  course) 
such  that  by  eeans  of  (1.1)  and  (1.2)  one  can  show  that 

(1.3)  FX  -  GX 

for  each  X,  or  for  an  indeterminate  X.  This  means  that  F  and  G  take  tha  same  value 
whenever  they  are  applied  to  each  X  whatever,  and  ao  they  ought  to  be  the  same 
function.  That  is,  one  should  have 

(1.4)  F  -  0  . 
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In  general ,  one  cannot  prove  (1.4)  by  means  of  (1.1)  and  (1.2).  Curry  (aaa  Curry  1930) 
contrived  additional  axioaa  auch  that  one  can  prove  (1.4)  whenever  (1.3)  holds  for  each 
X.  A  system  like  this  is  said  to  have  the  extenaional  property. 

Curry  added  axioms  to  enable  him  to  prove  additional  equalities.  Did  he  go  too  far, 
so  that  now  any  two  functions  can  be  proved  equal  to  each  other?  He  did  not.  Indeed,  he 
was  careful  to  prove  a  weak  form  of  consistency,  in  that  there  are  many  pairs  of  functions 
that  cannot  be  proved  equal  to  each  other.  And,  especially,  X  •  S  cannot  be  proved. 

This  gave  a  workable  system,  which  illuminated  many  properties  of  functions.  For 
instance,  let  M  be  built  up  from  K,  S,  and  the  variable  x.  One  can,  using  only  K 
and  S,  build  up  a  function  F  such  that  one  can  prove  that 

Fx  -  M 

by  means  of  (1.1)  and  (1.2).  F  turns  out  to  be  a  mixed  up  combination  of  X's  and  S's. 

Just  from  looking  at  F,  one  would  not  have  the  least  clue  that  Fx  -  M  should  hold. 
Because  F  is  constructed  in  order  to  give  the  result  Fx  -  M,  it  follows  that 

(1.5)  FN  -  M[xi-N]  , 

in  which  M[xi-N]  means  the  result  of  replacing  each  occurrence  of  x  in  H  by  n. 

Church  (see  church  1932)  proposed  that  the  F  in  question  be  called  fccM.  In  this, 

N  is  intentionally  part  of  the  name  of  the  function,  so  that  by  inspection  you  can  see 
what  you  would  get  if  you  apply  the  function  to  x.  For  his  construct.  Church  decreed  that 

(1.6)  (IxM)H  -  M[x:-N]  ; 
this  accords  exactly  with  (1.5). 

If  one  starts  with  the  left  side  of  (1.6)  and  replaces  it  by  the  right  side,  this  is 
called  a  6-reduction.  One  is  equally  entitled  to  start  with  the  right  side  of  (1.6)  and 
replace  it  by  the  left  side  of  (1.6)i  this  is  callsd  a  9-expansion. 

Thus,  to  produce  the  •  that  we  had  earlier,  Church  would  use  Xx( *y(x+y) ) .  By 

(1.6) ,  one  would  have 

(1.7)  ( Xx(  Ay(xty) )  )A  ■  ly(A+y)  . 

By  (1.6)  again,  one  has 


(1.8) 


( ly(A+y) )B  -  A+B 


So  taking  Xx( ly(x+y) )  to  ba  B,  one  has 

(1.9)  MB  -  (  lx(  Xy(jH-y) )  )AB  -  A+B 
by  two  6-reductions . 

the  beauty  of  this  is  that  at  all  stages  of  the  process,  one  can  tell  by  a  simple 
inspection  what  the  reduced  form  of  BAB  is  going  to  be.  This  is  the  famous  lambda- 
calculus  of  Church.-  (Henceforth,  we  will  write  LC  for  "lambda- calculus.")  It  does  involve 
one  with  having  to  be  careful  about  free  and  bound  occurrences  of  variables.  In  )y(x+y), 
the  occurrence  of  *  is  free  and  both  occurrences  of  y  are  bound. 

One  has  to  be  careful  not  to  make  manipulations  which  change  free  occurrences  of  a 

✓ 

variable  into  bound  ones.  Thus,  suppose  one  writes  By.  In  this  configuration,  the 
observed  occurrence  of  y  is  free.  A  blind  adherence  to  (1.6)  would  give 

By  -  *y(y+y)  , 

so  that 

Bys  *  r+z  . 

This  is  certainly  not  what  is  intended  for  B.  The  trouble  is  that  the  y,  which 
originally  existed  as  a  free  occurrence  of  a  variable  in  By,  has  been  put  into  Ay(y+y) 
where  its  occurrence  is  now  bound.  Actually,  when  Church  enunciated  the  rule  (1.6)  he  was 
careful  to  impose  the  restriction  that  it  should  not  be  used  if  some  variable  with  free 
occurrences  in  N  should  have  those  occurrences  bound  in  M[x:-N).  (In  addition,  one  must 
now  understand  H(x>-N]  to  mean  the  result  of  replacing  each  free  occurrence  of  x  in 
M  by  M.)  In  order  to  cope  with  this  contingency.  Church  instituted  the  or-step 

(1.10)  lyM  “  lz(M[y«-zJ)  . 

NOTBs  To  avoid  confusion  of  free  and  bound  variables  here,  one  must  put  the  restrictions 
that  there  are  no  free  occurrences  of  a  in  M,  and  that  no  occurrence  of  z  in 
Mlyt-sl  that  resulted  from  replacing  an  occurrence  of  y  in  M  by  z  is  bound.  Now  we 

have 

(1.11)  By  -  (lx(lz(x+z)))y  -  Iz(yfz)  i 

the  intermediate  formula  is  got  from  the  left  one  by  an  a- step.  So  one  has 


(1.12)  «yx  »  (Ar(y+r))x  »  y+x  , 
which  is  just  what  •  is  supposed  to  do. 

If  we  can  get  from  M  to  N  by  a  succession  of  steps,  possibly  null,  each  of  which 
is  either  an  o-atep  or  a  ^-reduction  or  a  ^-expansion,  we  say  that  M  is  convertible  to 
N>  we  write  "M  conv  H“ .  if  m  conv  N  by  a  succession  of  steps,  none  of  which  is 
a  0-expansion,  we  say  that  H  is  reducible  to  N;  we  write  *M  red  N“. 

John  McCarthy  worked  several  ideas  of  the  1C  into  LISP.  He  clearly  recognised 
procedures  as  functions  of  one  argument.  In  LC,  such  functions  can  be  applied  to  each 
other  and  give  such  functions  when  applied,  in  LISP,  it  is  possible  to  apply  one  procedure 
to  another,  and  on  occasion  get  another  procedure. 

As  we  said  earlier,  the  X  and  s,  and  things  built  exclusively  of  then,  are  called 
eombinators.  Can  we  cooningle  eombinators  and  lambda-expressions?  Yes  indeed,  with  no 
trouble  whatever  and  indeed,  variables,  or  indeterminates,  nay  be  freely  included. 

Church  had  decided  that  one  should  font  AxM  only  in  case  there  are  free  occurrences 
of  x  in  M.  Thus,  Church  could  not  get  a  laabda-expression  to  correspond  to  x.  It 
could  be  done  if  one  relaxes  the  requirement  that  there  be  at  least  one  free  occurrence 
of  x  in  M  to  form  XxM.  so  the  LC,  as  originally  set  up  by  Church,  seems  a  trifle 
weaker  than  the  combinatory  calculi  of  SchSnf inkel  and  Curry.  For  present  day 
applications,  either  would  serve  perfectly  well  {this  takes  some  proving)  and  the 
difference  is  just  something  to  niggle  over,  quite  insignificant,  originally,  this  was  not 
known,  and  Rosser  (see  Rosser  1935)  invented  a  couple  of  other  combinators,  in  place  of 
X  and  S,  with  which  he  set  up  an  exact  equivalent  of  the  LC.  Like  Curry,  his  system  had 
the  extenaional  property  and  a  weak  form  of  consistency.  Hence  the  LC  has  these  attributes 
also. 

The  LC  (and  hence,  the  combinatory  calculi)  has  a  fixed  point  theorem. 

Given  a  function,  P,  one  can  find  a  •  such  that 

(1.13)  F*  -  ♦  . 

Proof .  Take 

•  ”  where  IxF(xx)  . 


-5- 


There  ia  an  obvious  functional  relationship  batman  •  and  P,  naaely 


(1.14)  •  -  yp  , 
where 

(1.15)  Y-  Xf  ( ( Xxf  (xx) )  (  ixf  (xx) ) )  . 

On  pp.  177-179  of  Curry  and  Pays  1958,  Y  is  cal lad  the  paradoxical  ccmbinator.  The 
property 

(1.16)  P(YP)  -  YP 

for  each  P  is  noted,  which  the  authors  thought  to  be  paradoxical.  The  property  (1.16) 
aakes  Y  useful  in  sosm  of  the  aodarn  treatments  of  coabinators.  Sea  p.  37  of  Turner  1979 
(first  citation). 

2.  A  dkbacib. 

The  LC  and  the  combinatory  calculi  were  fairly  proaqptly  embedded  in  systems  which  had 
some  of  the  earlier  attributes  of  logical  systems.  See  Church  1932  and  Curry  1934.  The 
results  turned  out  to  be  inconsistent.  This  was  first  proved  in  Kleen a  and  Rosser  1935  by 
a  variation  of  the  Richard  paradox.  Later,  Curry  got  a  simpler  proof,  related  to  the 
Russell  paradox.  See  Curry  1942.  This  has  the  following  simple  fora.  Suppose  we  have  the 
two  familiar  logical  principles i 

(2.1)  PUP 

(2.2)  (P  D  IP  DUD  0  (P  3  B>  , 

together  with  modus  ponsns  (if  p  and  P  D  Q,  then  Q).  We  undertake  to  prove  an 
arbitrary  proposition  A.  He  construct  a  4  such  that 

(2.3)  ♦  -  9  3  A  » 

to  do  this,  we  taka  p  -  Xx(x  o  A)  in  the  fixed  point  theorem.  By  (2.1),  we  get 

•  36. 

Applying  (2.3)  to  the  second  6  gives 

♦  3  (•  D  A)  . 

By  (2.2)  and  modus  ponens,  we  get 

•  3  A  . 
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By  (2.3)  ravaraad,  we  gat 


•  . 

By  modus  ponena  and  tha  last  two  formulas,  wa  gat 

A  • 

This  is  usually  rafarrsd  to  as  tha  Curry  Paradox,  by  analogy  with  tha  Russall  Paradox. 

Pitch  proposed  to  avoid  this  by  weakening  tha  LC  (or  equivalent  combinatory  calculus) 
so  that  the  fixed  point  theorem  fails.  He  also  weakened  modus  ponena  a  bit.  See  Fitch 
1936  and  Pitch  1952.  He  has  proved  consistency  for  his  system,  but  it  is  much  too  weak  to 
be  considered  as  a  foundation  for  mathematics.  He  and  his  students  have  continued 
intersdttently  to  the  present  to  come  out  with  improvements ,  but  it  still  remains  extremely 
weak. 

Ackermann  proposed  keeping  the  full  strength  LC,  but  badly  crippling  implication.  See 
Ackermann  1950  and  Ackermann  1953.  Ha  proved  the  consistency  of  the  system,  but  it  was 
hopelessly  weak  as  a  foundation  for  mathematics,  and  I  know  of  no  recent  interest  in  it. 

Curry  kept  the  full  strength  combinatory  calculus,  but  added  a  fragmentary  theory  of 
types  and  a  weakened  version  of  implication.  He  introduces  a  notion  of  functionality, 

P,  such  that  if  Z  is  a  function  and  X  and  ¥  are  types,  then  PXYZ  is  to  denote  that 
if  U  is  of  type  X,  then  ZU  is  of  type  T.  See  curry  1934  and  Curry  1936.  It  turned 
out  that  the  'natural"  axioms  for  functionality  lead  to  a  contradiction,  as  shown  in  Curry 
1955.  However,  by  imposing  suitable  restrictions,  all  is  well.  See  Curry  1956.  Since 
then,  Curry  and  his  students  have  made  extensive  developments.  Two  major  works,  Curry  and 
Pays  1958  and  Curry,  Hindley,  and  Seldin  1972,  are  landmarks.  Even  so,  adoption  of  the 
system  as  a  foundation  for  mathematics  has  not  progressed  at  all,  though  the  system  has 
some  capability  in  that  direction*  see  Cogan  1955. 
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3.  WHERE  DO  WE  GO  TROH  HERE? 

As  «•  Mid,  Pitch  and  Curry  ara  continuing  to  develop  their  systems.  However,  there 
is  no  likelihood  that  either  will  be  adopted  as  a  foundation  for  mathematics.  Originally, 
it  warn  expected  that  the  1C  or  a  combinatory  calculus  should  be  a  part  of  such  a  system. 
Surprisingly  enough,  the  LC  (or  a  combinatory  calculus)  has  turned  out  to  be  of  importance 
in  its  own  right.  So  one  has  to  ask  the  questions  that  are  asked  about  any  logical  system. 

1.  what  about  consistency? 

2.  What  about  completeness? 

3.  What  about  models? 

4.  What  about  the  connection  with  computers? 

At  the  time  when  the  IC  and  the  combinatory  calculi  were  being  developed,  one  did  not 
ask  the  fourth  question.  Computers  had  not  yet  been  invented! 

4.  WHAT  ABOUT  COHSISTBHCY? 

We  observed  earlier  that  the  IC  and  the  combinatory  calculi  have  a  weak  consistency, 
in  that  one  cannot  prove  that  both  of  any  pair  of  functions  are  equal  to  each  other. 

Fairly  early  on  (see  Church  and  Rosser  '936;  reproduced  in  Church  1941)  a  considerably 
stronger  form  of  consistency  was  proved  for  the  IC,  embodied  in  the  Church- Rosser  Theorem 
(referred  to  hereafter  as  C-R-T). 

Suppose  XQ  red  X.,  and  X0  red  X2>  Then  there  is  an  X3  such  that  both  X,  red  X3  and  X2 
red  x3. 

A  lambda- formula  is  said  to  be  in  normal  form  if  it  has  no  part  on  which  one  can 
perform  a  9-r eduction.  A  lambda-formula  X  is  said  to  have  a  normal  form  Y  if  y  is  in 
normal  form  and  X  conv  Y.  We  can  prove  the  following  theorem. 

If  X  has  a  normal  form  Y,  then  X  red  Y  and  Y  is  unique,  except  possibly  for  a 
few  cosmetic  a- steps¬ 
ons  proves  the  first  pert  of  this  by  induction  on  the  number  of  operations  in 
X  conv  Y.  The  idea  is  as  follows.  Suppose  one  goes  from  X  to  by 

a  0-expansion,  next  from  W1  to  W2  by  a  0-reduction,  and  finally  from  Wj  to  Y  by  a 
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second  8-reduction.  Then  w1  red  X  and  red  Y.  So,  by  C-R-T,  there  is  a  W  such 
that  X  red  W  and  Y  red  w.  But  Y  is  in  normal  fora,  so  that  in  the  reduction  from  Y 
to  V  there  cannot  be  any  8-reduct :  ons;  only  a-steps.  So,  except  for  cosmetic  uses 
of  Q-steps ,  y  is  W,  and  we  had  X  red  W.  Por  uniqueness,  suppose  Z  is  another  normal 
fora  of  X.  Then  it  is  also  a  normal  form  of  Y.  So  Y  red  Z.  But  Y  is  in  normal 
form.  So  the  reduction  froa  Y  to  Z  can  consist  only  of  o-steps . 

The  lambda-formulas  of  interest  mostly  have  normal  forms.  These  normal  forms 
constitute  a  foundation,  on  which  is  erected  an  elaborate  superstructure.  However,  each 
normal  form  has  its  own  individual  superstructure,  not  overlapping  the  superstructures  of 
the  other  normal  forms. 

Because  formulas  of  the  LC  can  be  identified  with  formulas  of  a  combinatory  calculus 
and  vice  versa,  there  are  superstructures  in  the  combinatory  calculus  corresponding  to 
those  of  the  LC. 

In  (1.1)  and  (1.2),  one  can  consider  going  from  left  to  right  as  a  reduction.  So  one 
can  look  for  parallels  to  C-R-T,  one  can  define  normal  forms,  etc.  There  was  much 
investigation  of  these  questions. 

The  original  proof  of  C-R-T  was  fairly  long,  and  very  complicated.  In  Newman  1942, 
the  point  was  made  that  the  proof  was  basically  topological.  Newman  generalized  the 
universe  of  discourse,  and  defined  a  relation  with  properties  similar  to  a  8-reduction.  He 
proved  a  result  similar  to  C-R-T  by  topological  arguments.  Curry,  in  Curry  1952, 
generalized  the  Newman  result,  with  the  intention  that  it  would  be  relevant  to  similar 
considerations  in  the  combinatory  calculi.  Unfortunately,  it  turned  out  that  neither  the 
Newman  result  or  the  Curry  generalization  entailed  c-R-T  in  the  intended  systems  because 
the  systems  did  not  satisfy  the  hypotheses  of  the  key  theorems.  This  was  discovered  by 
David  E.  Schroer,  whose  counterexample  is  recorded  in  Rosser  1956.  In  Schroer  1965  is 
derived  still  further  generalizations  of  the  Newman  and  Curry  results,  which  indeed  do 
entail  C-R-T  in  assorted  systems.  As  Schroer  1965  is  62?  typed  pages,  this  hardly 
contributes  to  the  cause  of  shorter  and  simpler  proofs  of  C-R-T. 
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Chapter  4  of  Curry  and  rays  1958  is  davoted  to  s  proof  of  C-R-T  for  tha  LC,  and  to 
rolstod  aattors.  It  is  not  reecanended  for  light  reading.  In  Hindley  1969  and  Hindley  1974 
are  discussions  of  proofs  of  C-R-T  for  tha  LC  and  systems  closely  related  thereto. 

These  various  proofs  all  stamsMd  generally  from  the  Nessan  approach,  with  an  emphasis 
on  the  topological  structure.  However,  lambda- formulas  and  combinators  have  a  marked, 
though  specialised,  tree  structure.  Hitschke  1973  used  the  tree  properties  a  bit  in 
deriving  a  proof  of  C-R-T.  Rosen,  in  Rosen  1973,  really  went  overboard.  He  worked  with 
general  cross,  and  relationships  between  them,  as  lots  of  things  have  a  tree  structure, 
his  results  have  applications  beyond  proving  C-R-T.  He  applies  his  results  to  the  extended 
McCarthy  calculus  for  recursive  definition  (see  McCarthy  I960),  and  verifies  a  conjecture 
in  Morris  1968.  He  also  applies  his  results  to  tree  transducers  in  syntax-directed 
compiling.  With  all  that,  the  proof  of  C-R-T  did  not  come  easy.  He  had  to  prove  C-R-T* s 
for  several  related  systems,  and  then  derive  the  C-R-T  for  the  LC  by  some  trickery. 

Meanwhile,  a  genuine  simplification  for  the  proof  of  C-R-T  had  come  in  sight. 

See  Martin-LSf  1972.  It  is  agreed  that  Martin- I/5f  got  some  of  his  ideas  from  lectures  by 
M.  Tait.  An  exposition  of  the  proof  of  C-R-T  according  to  Talt  and  Martin-Lfif  appears  as 
Appendix  1  in  Hindley,  Lercher,  and  Seldin  1972.  A  shorter  exposition  appears  on  pp.  59-62 
of  Barendregt  1981.  Ha  will  give  what  seems  to  us  a  still  shorter  and  more  perspicuous 
proof  of  C-R-T. 

Mhat  seems  to  be  the  main  difficulty  of  the  proof?  Let  us  look  at  the  minimal  case. 
Suppose  X0  has  two  parts,  (AxW^Vj  and  ( XxV^JVj.  Let  XQ  red  xA  by  performing 
a  8-reduction  on  (AxW^IV^,  for  i  *  1,2.  if  (Xxw^)v^  and  ( IxW^ )V^  reside  in  totally 
disjoint  parts  of  XQ,  there  is  no  trouble.  To  get  X3  we  perform  a  8-reduction  on  the 
(Xxw^V^  that  still  resides  in  X^  and  on  the  (XxW^IV^  that  still  resides  in  X2. 

Mote  that  the  reductions  from  X3  to  X3  and  from  X2  to  X3  each  use  exactly 
one  production. 

But  suppose  that  (AxW^JV^  is  part  of  .  X2  will  contain  ( XxW^JV^,  where  V3 
is  the  result  of  a  8-reduction  of  ( XxW^ )V^  inside  V^.  As  a  candidate  for  X3,  we 
perform  a  8-reduction  on  the  ( IxV^  )V3  of  X2.  How  about  getting  from  x^  to  X3? 
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Where  XQ  had  (XxW^)V^,  Xj_  will  hm  W^Ixi-Vj).  If  there  had  bean  only  one  free 
occurrence  of  x  in  W^,  then  W^ (xi»V^l  will  contain  a  corresponding  V^j  we  change 
this  to  Vj  by  a  9-reduction  on  ( ^2V2  'V2* 

contain  several  free  occurrences  of  x.  Then  w^ [xi“V^]  will  contain  several  v^'s.  He 
can  go  through,  and  change  thaw  one  after  another  to  Vj's,  which  will  result  In  Xj.  But 
there  is  no  way  we  can  get  from  x^  to  X3  by  a  single  ^-reduction. 

In  the  language  of  Barendregt  1981,  p.  54,  9-reduction  does  not  have  the  diamond 
property. 

The  difficulty  is  that  it  may  take  several  9-reductions  to  get  frow  X^  to  Xj.  This 
should  have  suggested  working  with  a  string  of  ^reductions,  instead  of  only  one.  Why  it 
took  wore  than  thirty  years  for  this  to  occur  to  anyone  is  a  mystery. 

If  we  call  a  9-reduction  or  a-step  a  step,  then  a  string  of  thaw  will  be  a  walk.  But 
we  cannot  allow  just  any  old  string.  What  we  are  aiming  for  is  that  if  XQ  walk  X^ 
and  XQ  walk  X2,  then  there  is  an  Xj  such  that  X^  walk  Xj  and  X2  walk  X3>  If 
we  put  the  right  restrictions  on  the  steps  allowed  in  a  walk,  we  can  do  this. 

We  frame  our  restrictions  for  a  walk  as  follows. 

1.  A  walk  may  contain  no  steps  at  all. 

2.  It  nay  contain  ar-steps  at  will. 

3.  If  a  number  of  parts  (IxW^V^  fail  to  overlap  at  all,  the  corresponding 
9-reductions  may  be  done  in  any  order. 

4.  Let  (IxW)V  be  reduced  to  Wtx:«V]  in  a  9-reduction  of  the  walk.  Inside  that 
part,  W(xs«V] ,  no  subsequent  9-reductions  may  be  performed  in  the  walk,  and  indeed 

no  9-reduction  of  all  of  W(xi“V] ,  in  case  it  has  the  requisite  structure  (which  it 
could).  However,  a-steps  may  be  performed  inside  W[xs-V). 

The  relation  — »  on  p.  60  of  Barendregt  1981  is  likely  closely  related  to  our 
notion  of  a  walk,  but  it  is  not  exactly  the  same.  For  the  key  lemma,  Barendregt  uses 
something  like  induction  on  the  number  of  steps  from  XQ  to  X^  whereas  we  use  induction 
on  the  number  of  symbols  in  XQ.  This  makes  quite  a  difference. 

We  need  a  lemma,  which  is  about  as  follows. 


Suppose  X  walk  Y.  Then  X(x:*P]  walk  Ytx:«Pl  by  a  completely  parallel  series 
of  0-reductions. 

To  see  this,  note  that  o-steps  do  nothing  to  the  free  occurrences  of  x. 

A  0-reduction  can  rearrange  the  free  occurrences  of  x.  Xt  can  even  replicate  then,  as 
would  happen  if  the  0-reduction  were  from  (  XyS)T  to  S[y s“T] >  if  there  are  several  free 
occurrences  of  y  in  S,  they  would  each  be  replaced  by  T,  and  any  free  occurrences 
of  x  in  T  would  be  thereby  replicated. 

So,  if  occurrences  of  P  are  put  for  the  free  occurrences  of  x  in  X,  a  completely 
parallel  series  of  0-reductions  is  possible,  and  all  it  will  do  is  rearrange  or  replicate 
the  P's  just  as  the  walk  from  X  to  Y  did  for  the  free  occurrences  of  x.  At  the  end, 
we  just  have  Y[x:“P]  as  the  result.  The  fact  that  the  restrictions  for  a  walk  were 
satisfied  in  going  from  x  to  Y  assures  us  that  they  will  be  satisfied  in  going  from 
Xtxi-p]  to  Y[xj“P] • 

Actually,  the  lemma  is  not  quite  true,  because  of  the  possibility  of  confusion  of  free 
and  bound  variables.  Already,  before  you  try  the  first  step  from  X[xt-P]  to  Y[xt«P] , 
you  could  be  in  trouble  if  some  of  the  free  variables  in  P  became  bound  when  P  was  put 
for  x  in  X.  However,  a  very  close  relative  of  the  lemma,  sufficient  for  our  purposes, 
is  true. 

lemma.  Suppose  x  walk  Y.  In  x,  change  all  bound  variables  by  a-steps  to  a  set  of 
distinct  variables  that  have  no  occurrences  in  X  or  P.  This  gives  x1 ,  for  which  there 
is  a  Y1  such  that  X1  walk  Y1  by  essentially  the  same  ^reductions  as  were  used  for 
X  walk  Y.  Then  X^tx:*?]  walk  Y1 [xi-P]  by  a  completely  parallel  series  of  0*reductions . 

we  first  note  that  there  will  be  no  need  for  o-steps  in  either  the  walk  from  X1  to 
Y1  or  from  X1{x:«PJ  to  Y^Ixj-P].  All  possibility  of  confusion  of  bound  variables  has 
been  sidestepped  in  changing  from  X  to  X1 ,  and  we  can  now  use  the  argument  given 
originally. 
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Mot*  that  this  Imh  la  vary  nearly  th*  aati*  aa  proportion  2.1.17(1)  on  p.  28  of 
Barendregt  1981.  It  la  alao  cloaaly  related  to  propoaltlon  3.1.16  on  p.  SS.  In 
Barendregt’ s  terminology,  our  lemm’  says  that  a  walk  la  aubatltutiv*. 

Diamond  Property.  If  XQ  walk  X3  and  XQ  walk  X2,  then  there  la  an  X3  auch  that 
x1  walk  X3  and  X2  walk  X3. 

In  other  worda,  there  la  an  X3  which  la  th*  fourth  vertex  of  th*  diamond,  with  a 
walk  along  each  edge. 

Proof  by  induction  on  the  number  of  aymbola  In  XQ. 

Caae  1.  If  X0  haa  a  aingla  aytnbol,  it  la  immediate. 

Caa*  2.  Let  XQ  be  Then  X^  auat  be  IxM^  for  i-1,2,  clearly  we  have 

M0  walk  for  i  “  1,2.  80  there  ia  a  m3  auch  that  walk  m3  for  i-1,2.  Take 
Xj  to  be  IxMj. 

Caa*  3.  Let  XQ  be  M0M0. 

Subcaaa  1.  Xl  ia  IIjMj  with  mq  walk  MA  and  M0  walk  H^,  all  for  i-1,2.  Then 
there  are  l*3  and  M3  with  M4  walk  M3  and  walk  MJt  both  for  i-1,2.  Take 

*3  -  m3m3. 

Subcaaa  2.  mq  ia  XyWQ,  X3  ia  W^yi-J^l ,  and  X2  ia  (XyW2>l»2.  By  restriction 
4,  the  laat  8-reduction  in  XQ  walk  X1  had  to  be  from  (  XyW^N^.  So  we  have  wQ  walk  W± 
and  NQ  walk  Mt,  both  for  i-1,2.  Then  there  are  W3  and  Nj  such  that  walk  W3  and 

Na  walk  N3,  both  for  i-1,2.  Then  we  have  X2  walk  OyWjJNj,  and  hence  we  take  X3  to 
be  M3(y«-N3l.  There  are  various  N^’s  in  W^Iyj-N^J,  but  they  are  non-overlapping.  So 
we  operate  on  each  in  turn,  and  have  X3  walk  Wj^yt-Nj) .  By  our  lemma,  W1ty:-M3]  walk 
W3  (yi-Njl .  the  latter  being  X3,  except  for  some  a-stepa.  Mow  the  steps  we  took  in  going 
from  Wjlyj-l^]  were  all  on  n3's  that  had  been  put  for  y*s  in  W3.  So  none  of  them 
could  violate  restriction  4  as  we  go  on  down  to  M3(y:-M3l  from  w3ty i-Nj] .  So  we  can  put 
these  two  walks  together  to  conclude  x3  walk  x3. 

Subcase  3.  Like  subcase  2,  except  with  X3  and  Xj  interchanged.  Make  suitable 
interchanges  in  the  proof  of  subcase  2. 


-13- 


Subcase  4.  M0  ia  iy«o,  and  XA  ia  for  i-1,2.  By  restriction  4,  the 

last  traduction  in  XQ  walk  had  to  ba  fro*  (  *y  )fc  »  both  for  1*1,2.  So  wa  have 


»o 

walk 

"i 

and 

»o 

walk  both 

for 

i-1,2. 

So  thara  are  W3 

and  N3  auch  that 

«1 

walk 

*3 

and 

»i 

walk  N}  both 

for 

i-1,2. 

There  ara  various 

^‘s  in  t^tyi-Xi],  but 

thay  ara  non-ova r lapping.  So  wa  operate  on  aach  in  turn,  and  have  X1  walk  , 

both  for  i-1,2.  By  our  leaaa,  V^lyt-Nj]  walk  W3 ty :“N31 ,  both  for  i*l»2,  except  for 
aoa»  o-steps.  X3  ia  MjCyi-Nj]  •  To  gat  from  X1  down  to  X3,  wa  have  to  coabine  two 
walks,  both  for  i-1,2,  but  the  argument  for  this  goes  as  in  subcase  2. 

Now  wa  prove  something  that  looks  like  C-R-T. 

If  XQ  goes  to  X^  by  a  succession  of  walks,  both  for  i«l ,2,  than  thara  ia  an 
Xj  auch  that  X^  goes  to  X3  by  a  succession  of  walks,  both  for  1*1,2. 

The  proof  ia  so  aaay  that,  if  wa  carry  out  the  details  for  a  special  case,  the  whole 
thing  bacoaiaa  obvious.  So,  lat  XQ  walk  walk  W2  walk  X^  and 
XQ  walk  W3  walk  X^.  By  the  Diawond  Property,  wa  can  fill  in  W1's  to  ba  corners 
in  figure  1. 


Figure  1 

As  each  0-reduction  or  a-atep  taken  alone  ia  a  walk,  C-R-T  follows  by  the  previous 
result. 

Although  the  proof  in  Newsan  1942  failed  to  prove  the  C-R-T  for  the  LC,  it  does  prove 
a  C-R-T  for  a  fairly  general  universe  of  discourse.  Some  cases  of  this  have  been  found  to 
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be  useful,  though  they  have  not  such  in  common  with  thu  lc.  Saa  Book  1982,  and  several  of 
tha  other  authors  cited  in  bibliographic  rafarences  in  Book  1982. 


5.  WHAT  ABOUT  C0HPL1TEHBS8? 

At  first  sight,  it  appears  that  the  LC  la  so  weak  that  it  is  absurd  even  to  raise  the 
qua st ion.  However,  as  indicated  in  Church  1932  and  amplified  in  Kleene  193S  ,  the  positive 
integers  can  be  defined  in  tha  LC.  If  n  is  a  positive  integer,  we  let 

lf(Xx(f(f(...(f(fk)). ..)>))  , 

where  there  are  n  f's,  denote  the  integer  n.  This  aakes  one  form  of  recursive 
definition  easy.  If  F(n)  la  to  be  defined  by 

(5.1)  F(l)  -  GA 

(5.2)  F(n+1)  -  G(Fn)  , 

than  we  can  take  F  to  be 

(5.3)  In(nCSA)  . 

With  this  definition, 

FI  red  GA 
F2  red  G(GA) 

F3  red  G(G(GA) ) 
etc. 

However,  there  is  no  aero  in  this  ays tea.  One  would  prefer  the  recursive  definition 
to  be  given  by 

(5.4)  F( 1)  -  A 

(5.5)  F(n+1)  -  G(Fn)  . 

In  Kleene  1936,  Kleene  worked  out  a  way  to  do  this.  This  opened  the  door  to  still  aore 
general  recursive  definitions.  More  and  aors  definitions  of  functions  tram  integers  to 
integers  were  discovered.  So aa  never  published  investigations  by  Rosser  disclosed  so  aany 
that  in  about  1934  Church  was  led  to  conjecture  that  every  effectively  calculable  function 
froa  positive  integers  to  positive  integers  is  definable  in  the  LC.  It  was  known  froa 
Church  and  Rosser  1936  that  every  function  froa  positive  integers  to  positive  integers  that 


is  definable  in  the  LC  is  effectively  calculable.  So  Church  enunciated  what  is  now  known 


as  "Church’s  Thesis." 

Church's  Thesis.  Effectively  calculable  functions  from  positive  integers  to  positive 
integers  are  just  those  definable  in  the  LC. 

As  "effectively  calculable"  is  an  intuitive  notion.  Church's  Thesis  is  not  susceptible 
of  proof.  However,  it  states  a  strong,  and  quite  unexpected,  version  of  completeness. 

In  about  this  ers,  GSdel  and  Kleene  were  trying  to  get  a  definition  for  "general 
recursive  function."  Kleene  gives  a  definition  in  Kleene  1936.  He  attributes  it 
to  GSdel.  Gddel  thought  that  general  recursiveness  should  be  taken  as  the  criterion  of 
effectively  calculable.  However,  in  Xleene  1936  it  is  shown  that  general  recursiveness  is 
the  same  as  being  definable  in  the  LC.  This  lent  strong  support  to  Church's  Thesis. 

Independently,  Turing  had  been  developing  the  abstract  idea  of  a  computer ,  the 
so-called  "Turing  machine. ”  See  Turing  1936.  Turing  thought  that  "effectively  calculable" 
should  be  taken  to  be  the  same  as  calculable  on  a  Turing  machine.  But  in  Turing  1937,  he 
proved  that  that  is  the  same  as  being  definable  in  the  LC.  This  result  explains  why  the 
lambda-calculus  and  the  combinatory  calculi  can  (and  do)  play  such  an  important  role  in  the 
theory  of  computer  programming,  and  such  matters. 

Independently,  in  Post  1936,  Post  had  developed  ideas  very  similar  to  those  of 
Turing.  Turing  published  first,  by  a  very  few  months.  Later,  in  Post  1943,  still  another 
definition  of  "effectively  calculable"  was  proposed,  which  turned  out  to  be  equivalent  to 
those  already  given.  Still  later,  in  Harkov  1951,  Markov  gave  yet  another  definition, 
which  was  also  proved  to  be  equivalent.  A  translation  of  this  appears  as  Harkov  1961.  In 
Smullyan  1961,  using  his  "elementary  formal  systems,"  still  another  definition  is  given, 
which  is  also  equivalent. 

With  the  development  of  actual  computers,  which  are  finite  approximations  for  a 
universal  Turing  machine,  interest  in  all  these  matters  has  been  much  intensified,  in 
Kleene  and  vealey  1965,  on  p.  3,  the  authors  list  150  contributions  to  the  subject  by 
October  15,  1963.  By  now  there  are  far  more. 
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K1 sene's  early  developments  In  recursion  theory  were  of  such  importance  for 
computing.  However,  though  he  still  uses  many  notations  from  the  LC,  he  has  diverged  far 
from  it  into  an  area  that  is  now  essentially  of  no  use  in  computing,  though  active  and  of 
interest  to  many  people. 

There  have  been  some  objections  to  church's  Thesis.  In  Moachovakia  1968  is  given  a 
simultaneous  review  of  four  papers,  by  Jean  Porte,  Ldssl 6  Kalmdr,  Rdzsa  ptfter,  and  Elliott 
Mendel eon.  The  first  three  papers  attempt  in  various  ways  to  discredit  Church's  Thesis. 

The  paper  by  Mendelson  discusses  the  first  three  papers,  and  undertakes  to  show  that  their 
criticisms  are  illfounded.  In  the  opinion  of  the  reviewer,  he  succeeds  quite  adequately. 

I  know  of  no  recent  attacks  on  Church's  Thesis,  and  it  seems  to  be  generally  accepted  as  an 
important,  if  unorthodox,  version  of  completeness  for  the  LC. 

6.  WHAT  ABOUT  MODELS 7 

There  is  a  classic  theorem  that  says  that,  if  a  logic  is  consistent,  it  will  have  a 
model}  Indeed  a  denumerable  one.  However,  the  LC  is  so  different  in  structure  from  the 
usual  logics  that  the  theorem  does  not  apply  to  it. 

Why  does  one  wish  a  model?  If  one  has  a  framework  with  a  lot  of  structure,  and  the 
logic  is  isomorphic  to  some  pert  of  the  framework,  then  the  structure  in  the  framework  can 
contribute  to  your  understanding  of  the  logic.  One  can  always  manufacture  a  very 
superficial  model  by  taking  equivalence  classes  of  objects  in  the  logic.  The  only 
structure  this  has  is  what  is  forced  on  it  by  the  logic  itself.  So  no  additional 
understanding  can  come  from  studying  the  structure  of  the  model.  Such  a  model  does  little 
good. 

For  a  very  long  time,  this  was  the  only  kind  of  model  that  was  found  for  the  LC. 
Finally,  with  encouragement  from  strachey,  Dana  Scott  hit  on  a  way  of  making  some  really 
useful  models.  They  could  be  constructed  either  in  the  category  of  topological  spaces  or 
in  the  category  of  lattices.  An  exposition,  "Outline  of  a  mathematical  theory  of 
computation,"  appears  in  pp.  169-176  of  the  Proc.  Fourth  Annual  Princeton  Conf.  on 
Information  Sciences  and  Systems,  1970.  In  case  this  is  inaccessible,  another  exposition 
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appears  as  the  final  article  in  Engelar  1971.  In  Barondragt  1981  is  given  a  model  similar 
to  the  Scott  one,  but  in  a  still  sore  general  framework,  namely  the  category  of  complete 
partial  orders.  In  one  sense,  this  is  good  since  one  can  derive  still  sore  properties  of 
the  XC  in  this  sore  general  category.  However,  suppose  one  would  like  just  to  see  a  model 
without  having  to  learn  all  the  algebra  involved  in  topological  spaces,  lattices,  or 
complete  partial  orders.  Some  people  have  been  working  in  that  direction,  to  get  a  model 
without  all  the  algebraic  baggage.  This  is  mostly  available  only  in  unpublished  material, 
such  as  Plotkin  1972,  Sngeler  1979,  and  Meyer  1982>  the  latter  gives  a  fairly  coaq>lete  and 
coherent  account.  According  to  Meyer,  the  model  originated  with  Plotkin,  was  improved  by 
Sngeler,  and  further  improved  by  Meyer  himself.  Our  account  is  taken  from  the  Meyer  paper. 

Start  with  a  nonempty  set.  A;  the  unit  class  consisting  of  the  ordered 
pair  < 4, will  do,  where  +  is  the  null  class.  Enlarge  A  to  the  least  set  B 
containing  A  and  all  ordered  pairs  <  0, b> ,  where  0  is  a  finite  subset  of  B  and  b 
is  in  B. 

The  model  consists  of  all  subsets  of  B.  For  two  members,  C  and  D,  of  the  model, 
define 

(«.l)  (CD)  -  (b€B|<  0,b>  SC  and  0C  d}  . 

To  show  that  this  contains  a  model  of  the  combinatory  calculus,  we  Identify  two 
elements  K  and  s> 

(6.2)  K  -  {<a,<0,b>>  |bea  and  a,  0  finite  subsets  of  b}  . 

(6.3)  S  -  {<a,<0,<Y,b»>|bcay( 0y)  and  a,  0,y  finite  subsets  of  b}  . 

One  verifies  fairly  easily  that 

(6.4)  KCD  -  C 

(6.5)  SCDE  -  CE(DE) 

for  all  elements  of  the  model.  A  close  relative  of  the  extensional  property  holds;  see 
Meyer  1982. 

Since  the  1C  is  so  closely  related  to  the  combinatory  calculi,  it  is  not  surprising 
that  something  very  similar  can  be  put  together  as  a  model  for  the  LC.  In  Meyer  1982  there 
are  full  details. 
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7.  WHAT  ABOUT  THE  COMMBCTIOM  WITH  COMWTHU)? 


This  proceeds  in  two  directions.  One  esn  use  computers  to  manipulate  coabinetors  or 
foraulas  of  the  1C,  or  one  can  use  properties  of  coabinetors  and  the  L£  to  help  in 
progrma)  ng  or  to  develop  ideaa  of  use  for  coaputer s. 

booking  to  the  first/  the  obvious  approach  would  be  to  represent  the  coabinetors ,  or 
foraulas  of  the  1C,  as  lists  or  arrays  in  the  coaputer  aeaory.  Zn  fact,  these  foraulas  are 
tree  structures,  and  might  better  be  represented  so  on  the  coaputer .  Knowing  the  location 
of  only  the  root  of  the  tree  then  suffices  to  reconstruct  the  entire  tree.  So  the  trees 
(entire  foraulas)  can  be  Identified  by  single  aeaory  locations,  instead  of  by  elaborate 
diagraas  or  linearisations  thereof. 

The  idea  la  very  siaple.  Suppose  A  and  B  are  coabinetors,  and  we  have  put  their 
roots  at  aeaory  locations  a  and  b.  Then  we  represent  C  ■  (AB)  by  locating  its  root  at 
aeaory  location  c;  in  c  we  put  the  ordered  pair  of  nuabers  a  and  b.  The  person  who 
wishes  to  know  the  structure  of  C  is  told  to  look  at  location  c.  There  he  finds 
<a,b>,  which  tells  hie  that  C  has  the  fora  (AB),  and  that  to  know  the  fora  of  A  ha 
should  look  in  location  a,  and  siailarly  for  B. 

Besides  the  convenience  in  referring  to  a  foraula,  this  allows  economies  of  aeaory 
which  are  not  possible  when  a  formula  is  represented  by  a  list.  For  an  extreme  example, 
suppose  B  *  ((AA)(AA)),  where  a  requires  1000  aeaory  locations  for  its 
representation.  To  represent  B  as  a  list  would  require  four  repetitions  of  the  listing 
of  A,  together  with  attendant  parentheses;  a  total  of  4006  locations.  With  the  tree 
representation,  let  A  have  its  root  at  at  we  aay  still  suppose  that  the  entire 
representation  of  A  fills  1000  locations.  At  some  convenient  aeaory  location,  d,  we 
put  <a,a>,  which  denotes  D  "  (AA).  Then  at  another  empty  aeaory  location,  b,  we  put 
<d,d>,  which  denotes  (DO) .  But  (DO)  is  ((AA)(AA))  •  B.  Thus,  with  A  represented  in 
1000  aeaory  locations,  we  require  only  1002  locations  to  represent  ( (AA) (AA) ) . 

Aother  advantage  of  the  tree  representation  is  that  it  lends  itself  to  what  is  called 
"lasy  evaluation."  Suppose  a  part  M  occurs  several  times  in  a  foraula  x.  If  X  is 
represented  as  a  list,  the  several  occurrences  of  M  are  each  written  out  in  full.  Unless 
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extraordinary  measures  are  taken,  each  of  the  occurrences  of  M  will  be  evaluated 
separately,  and  independently,  in  the  course  of  evaluating  X.  However,  with  a  tree 
structure,  M  will  occur  only  once,  but  with  various  pointers  'pointing*  to  it.  Hence,  it 
will  be  evaluated  once  only. 

These,  and  many  related  matters,  are  taken  up  in  Petsnick  1970.  Consider  a  typical 
program  on  a  computer,  say  for  computing  an  approximation  to  the  square  root  (two  integers, 
a  mantissa  and  an  exponent).  If  one  inputs  an  approximation  for  a  real  number  (a  mantissa 
and  an  exponent)  the  program  will  generate  and  output  an  approximation  for  the  square  root. 
So  the  program  defines  a  function.  Naturally,  it  is  a  computable  function.  So  (by  one  of 
the  equivalences  supporting  Church's  Thesis)  this  function  mi  st  be  expressible  by  mans  of 
a  combinatory  formula.  If  suitable  hardware,  or  software  simulations  thereof,  is 
available,  the  calculation  can  be  done  solely  by  combinatory  manipulations. 

Soswthing  of  the  sort  had  been  proposed  for  lambda- formulas  by  Landin.  For  this 
purpose,  he  defined  and  used  what  he  calls  SECD  machines.  See  Landin  1965  or  "A  formal 
description  of  ALGOL  60,"  pp.  266-294  in  Steel  1966.  However,  this  involved  him  in  a  very 
difficult  problem  of  handling  the  complicated  substitutions  properly.  If  he  had  used 
combinatory  formulas  instead,  this  problem  would  be  much  simplified.  Also,  Landin  tried  to 
superpose  the  lambda-formulas  on  top  of  the  usual  computer  software.  This  produced  a 
greatly  complicated  assignment  problem.  If  one  would  dispense  with  the  usual  computer 
software,  and  work  only  with  combinatory  formulas  stored  in  the  memory  (preferably  as 
trees)  the  assignment  problem  would  simply  disappear. 

Petxnick's  thesis,  Petsnick  1970,  showed  that  it  is  possible  to  design  a  computer  to 
work  exclusively  with  combinatory  formulas,  stored  as  trees.  There  is  no  assignment 
problem,  and  application  takes  the  place  of  substitution.  As  application  is  the  basis  of 
the  tree  structure,  it  is  handled  automatically.  The  hardware  one  would  have  to  build  to 
handle  this  would  be  quite  simple,  or  it  can  be  handled  with  present  hardware  by  a 
suitable  software  simulation. 

Petsnick' s  thesis  managed  to  evade  everybody's  attention,  and  nothing  mor >  was  done  in 
that  area  for  a  while.  But  after  some  years,  work  similar  to  Petsnick's,  and  extending  it. 
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began  to  appear,  and  has  quickly  blossomed.  It  now  engages  the  attention  of  a  conaiderable 
number  of  people,  all  of  whoa  aeaa  to  be  quite  unaware  of  Petsnick's  work. 

There  ia  quite  a  ferment  of  activity  juat  now,  and  several  papers  were  presented  at 
the  1982  ACM  Symposium  on  LISP  and  Functional  Programming  at  Pittsburghi  a  set  of 
Proceedings  is  available  under  ACM  order  number  552820.  It  would  surpass  ay  powers  as  a 
soothsayer  to  determine  what  will  emerge  as  the  key  ideas i  perhaps  soma  have  not  yet 
emerged. 

I  will  sketch  a  couple  of  trains  of  development,  to  give  the  reader  some  sort  of  idea 
what  is  happening.  In  so  doing,  I  may  fail  to  note  something  that  will  be  of  major 
importance,  and  so  fail  to  give  credit  due  to  those  who  are  working  on  it. 

In  Henderson  and  Morris  1976  appeared  an  idea  for  lasy  evaluation.  The  two  papers, 
both  cited  as  Turner  1979,  carried  this  forward,  and  also  showed  how  to  condense 
combinatory  formulas  very  much,  thereby  alleviating  what  had  been  a  problem  for  Petsnick. 
More  on  that  last  point  is  given  in  Hughes  (to  appear).  There  are  now  programs  for 
manipulating  combinators  directly.  One  is  given  in  CRS/1.  Another  is  SKIM,  which  was 
announced  in  1980,  and  is  now  being  improved  by  a  group  at  Cambridge  university.  Backus 
1978  does  not  seem  to  be  in  the  main  stream  of  this  activity,  but  it  has  some  quite  novel 
combinatory  functions,  and  something  interesting  nay  evolve  out  of  it. 

It  seems  to  be  now  established  that  operating  directly  on  computers  in  combinatory 
format  is  not  only  feasible,  but  has  seme  advantages.  Even  more  useful  results  may  be  just 
around  the  corner.  Or  they  may  have  already  been  announced  without  my  appreciating  their 
worth. 
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